Method and system for voice input at an ATM without audibly revealing a selected transaction

ABSTRACT

Embodiments disclosed herein generally relate to a system and method for transacting at an ATM. A computing system receives a request from the ATM to provide audio commands for an input having one or more components. The computing system generates an audio message for a component of the one or more components. The computing system receives an indication from the user to select an entry from the set of candidate entries as the component. The indication from the user does not include a recitation of the first entry. The computing system maps the indication to a respective candidate in the set of candidate entries. The computing system generates a verification audio message for the user to audibly verify a correct mapping between the indication and the respective candidate in the set of candidate entries. The computing system receives an audible verification from the user.

CROSS-REFERENCE TO RELATED APPLICATIONS

This is a continuation of U.S. application Ser. No. 16/002,358, filedJun. 7, 2018, the content of which is hereby incorporated by referencein its entirety.

FIELD OF THE DISCLOSURE

The present disclosure generally relates to a method and a system forproviding voice input at an automated teller machine (ATM).

BACKGROUND

Conventional automated teller machines (ATMs) require human interventionor human interface with a display of the ATM. For example, when anaccount holder wants to withdraw money from his or her account, theaccount holder interfaces with the ATM's display to enter the accountholder's credentials (e.g., PIN) number, and then navigate through apre-set number of screens to withdraw a desired amount of funds from theaccount. Accordingly, the ATM provides a gateway for the end user tomanage or manipulate his or her account with a financial servicesorganization.

SUMMARY

Embodiments disclosed herein generally relate to a system and method fortransacting at an automated teller machine (ATM). A computing systemreceives a request to provide audio commands for an input having one ormore components. The computing system generates an audio message for acomponent of the one or more components. The audio message includes aset of candidate entries. The computing system receives an indicationfrom the user to select an entry from the set of candidate entries asthe component. The indication from the user does not include arecitation of the first entry. The computing system maps the indicationto a respective candidate in the set of candidate entries. The computingsystem generates a verification audio message for the user to audiblyverify a correct mapping between the indication and the respectivecandidate in the set of candidate entries. The computing system receivesan audible verification from the user.

In some embodiments, the computing system generates a further audiomessage for a further component of the one or more components. Thefurther audio message includes a further set of candidate entries. Thecomputing system receives a further indication from the user to select afurther entry from the further set of candidate entries as the furthercomponent. The further indication from the user does not include arecitation of the further entry. The computing system maps the furtherindication to a further respective candidate in the further set ofcandidate entries. The computing system generates a further verificationaudio message for the user to audibly verify a correct mapping betweenthe further indication and the further respective candidate in thefurther set of candidate entries. The computing system receives afurther audible verification from the user.

In some embodiments, the first set of candidate entries and the furtherset of candidate entries are equivalent.

In some embodiments, receiving the request to provide audio commands forthe input having one or more components in the ATM includes thecomputing system receiving a request to enter a personal identificationnumber (PIN) that includes one or more digits. Each digit corresponds toa respective component of the one or more components.

In some embodiments, receiving the request to provide audio commands forthe input having one or more components in the ATM includes thecomputing system receiving a request to enter a withdrawal amount.

In some embodiments, generating the audio message for the component ofthe one or more components is triggered by identifying an auxiliaryconnection is established.

In some embodiments, the computing system determines that theestablished auxiliary connection is broken. The computing system haltsthe request to provide audio commands until the auxiliary connection isre-established.

In another embodiment, a system is disclosed herein. The system includesa processor and a memory. The processor is in selective communicationwith a microphone and a speaker. The memory has programming code, which,when executed by the processor, performs an operation of transacting atan automated teller machine (ATM). The operation includes receiving arequest to provide audio commands for an input having one or morecomponents. The operation further includes identifying that acommunication link has been established between the processor and themicrophone and speaker. The operation further includes generating anaudio message for a component of the one or more components, the audiomessage comprising a set of candidate entries. The operation furtherincludes delivering the audio message to the user via the speaker. Theoperation further includes receiving an indication from the user via themicrophone to select an entry from the set of candidate entries as thecomponent. The indication from the user does not include a recitation ofthe first entry. The operation further includes mapping the indicationto a respective candidate in the set of candidate entries. The operationfurther includes generating a verification audio message for the user toaudibly verify a correct mapping between the indication and therespective candidate in the set of candidate entries. The operationfurther includes delivering the verification audio message to the uservia the speaker. The operation further includes receiving an audibleverification from the user via the microphone.

In some embodiments, the operation further includes generating a furtheraudio message for a further component of the one or more components, thefurther audio message comprising a further set of candidate entries. Theoperation further includes delivering the further audio message to theuser via the speaker. The operation further includes receiving a furtherindication from the user via the microphone to select a further entryfrom the further set of candidate entries as the further component. Thefurther indication from the user does not include a recitation of thefurther entry. The operation further includes mapping the furtherindication to a further respective candidate in the further set ofcandidate entries. The operation further includes generating a furtherverification audio message for the user to audibly verify a correctmapping between the further indication and the further respectivecandidate in the further set of candidate entries. The operation furtherincludes delivering the verification audio message to the user via thespeaker. The operation further includes receiving a further audibleverification from the user via the microphone.

In some embodiments, the first set of candidate entries and the furtherset of candidate entries are equivalent.

In some embodiments, the processor and memory and positioned within theATM.

In some embodiments, the operation of receiving the request to provideaudio commands for the input having one or more components includesreceiving a request to enter a personal identification number (PIN) thatcomprises one or more digits. Each digit corresponds to a respectivecomponent of the one or more components.

In some embodiments, the operation of receiving the request to provideaudio commands for the input having one or more components includesreceiving a request to enter a withdrawal amount.

In some embodiments, the operation further includes determining that thecommunication link is broken and halting the request to provide audiocommands until the communication link is re-established.

In some embodiments, the memory further includes a Bluetooth module forselective communication with a client device of the user.

In another embodiment, a method of transacting at an automated tellermachine (ATM) is disclosed herein. A computing system determines that anauxiliary connection is established between a user and the ATM. Thecomputing system receives an audio request from the user to input apersonal identification number (PIN). The PIN includes one or moredigits. The computing system generates an audio message for a digit ofthe one or more digits. The audio message comprising a set ofcandidates. The computing system provides the audio message to the uservia the auxiliary connection. The computing system receives anindication from the user to select an entry from the set of numbers as adigit of the one or more digits of the PIN. The indication from the userdoes not include a recitation of the entry. The computing systemcontinues to generate subsequent audio messages for subsequent digits ofthe one more digits, providing the subsequent audio messages to theuser, and receiving subsequent indications from the user to selectsubsequent entries from the set of numbers as a respective digit of theone or more digits of the PIN until each digit of the one or more digitsof the PIN are entered.

In some embodiments, the computing system determines that the auxiliaryconnection between the user and the ATM is broken and halts thegenerating of the audio message until the auxiliary connection isre-established.

In some embodiments, wherein the set of candidates are transmitted tothe user in a pattern that is a sequential array of numbers.

In some embodiments, a duration of a gap between each candidate in theset of candidates varies.

In some embodiments, receiving the indication from the user includes areply message comprising one or more of back one, forward one, and stop.

BRIEF DESCRIPTION OF THE DRAWINGS

So that the manner in which the above recited features of the presentdisclosure can be understood in detail, a more particular description ofthe disclosure, briefly summarized above, may be had by reference toembodiments, some of which are illustrated in the appended drawings. Itis to be noted, however, that the appended drawings illustrated onlytypical embodiments of this disclosure and are therefore not to beconsidered limiting of its scope, for the disclosure may admit to otherequally effective embodiments.

FIG. 1 is a block diagram illustrating an exemplary computingenvironment, according to one embodiment.

FIG. 2 is a flow diagram illustrating an exemplary method of receivingvoice input at an ATM, according to one embodiment.

FIG. 3 is a flow diagram illustrating an exemplary method of receivingvoice input at an ATM, according to one embodiment.

FIG. 4 is a flow diagram illustrating an exemplary method of receivingvoice input at an ATM, according to one embodiment.

FIG. 5 illustrates an exemplary computing environment, according to oneembodiment.

To facilitate understanding, identical reference numerals have beenused, where possible, to designate identical elements that are common tothe figures. It is contemplated that elements disclosed in oneembodiment may be beneficially utilized on other embodiments withoutspecific recitation.

DETAILED DESCRIPTION

The present disclosure generally relates to a method and a system forproviding voice input at an automated teller machine (ATM). Inparticular, when an account holder wants to interact with his or heraccount (e.g., withdraw funds, deposit funds, check balance, etc.), anATM may provide the bridge between the user and the financial servicesorganization. For example, the account holder may interface with theATM's display to enter the account holder's credentials (e.g., PIN)number, and then navigate through a pre-set number of screens towithdraw a desired amount of funds from the account. Conventional ATMs,however, are not well equipped at catering to account holders that maybe vision impaired.

The one or more techniques described herein addresses this shortcomingof conventional ATMs by reconfiguring the ATM's controller to allow forvoice input in place of the traditional screen navigating. The user mayrequest voice input upon connecting an audio input/output (I/O) deviceto the ATM. For example, the user may request voice input upon the ATMdetecting a wireless or wired connection to the user's headphones. Byrestricting voice input to those situations in which an audio I/O deviceis detected, the ATM can broadcast audio commands to the user discretely(i.e., via headphones). Similarly, a user may provide voice input to theATM discretely (e.g., through generic responses).

The term “user” as used herein includes, for example, a person or entitythat owns a computing device or wireless device; a person or entity thatoperates or utilizes a computing device; or a person or entity that isotherwise associated with a computing device or wireless device. It iscontemplated that the term “user” is not intended to be limiting and mayinclude various examples beyond those described.

FIG. 1 is a block diagram illustrating a computing environment 100,according to one embodiment. Computing environment 100 may include anautomated teller machine (ATM) 102 and financial services server 104communicating via network 105. ATM 102 may be operated by one or moreusers. For example, a user may operate ATM 102 to perform one or morefunctions associated with a user's bank account. Such functions mayinclude, for example, withdrawing funds, depositing funds, transferringfunds, checking an account balance, and the like. In operation, a usermay obtain ATM 102 functionality via an ATM card assigned to the user.For example, a user may obtain ATM 102 access by inserting the user'sATM card and entering a personal identification number (PIN) preset bythe user.

Network 105 may be of any suitable type, including individualconnections via the Internet, such as cellular or Wi-Fi networks. Insome embodiments, network 105 may connect terminals, services, andmobile devices using direct connections, such as radio frequencyidentification (RFID), near-field communication (NFC), Bluetooth™,low-energy Bluetooth™ (BLE), Wi-Fi™, ZigBee™, ambient backscattercommunication (ABC) protocols, USB, WAN, or LAN. Because the informationtransmitted may be personal or confidential, security concerns maydictate one or more of these types of connection be encrypted orotherwise secured. In some embodiments, however, the information beingtransmitted may be less personal, and therefore, the network connectionsmay be selected for convenience over security.

Network 105 may include any type of computer networking arrangement usedto exchange data. For example, network 105 may be the Internet, aprivate data network, virtual private network using a public networkand/or other suitable connection(s) that enables components in computingenvironment 100 to send and receiving information between the componentsof environment 100.

ATM 102 may include a controller 103 and display 115. Controller 103 maybe a computing system that is configured to facilitate communicationbetween financial services server 104 and the user. Controller 103 mayinclude audio command agent 110, Voice-to-Text agent 109, and naturallanguage processor (NLP) 111. Audio command agent 110 may be configuredto transmit audio messages to a user of ATM 102 as well as receive audiocommands from user. Audio command agent 110 may interface with an audioinput/output (I/O) device 108. Such audio I/O devices may include, forexample, a device that includes at least speakers such that the user ofI/O device 108 can hear audio commands from ATM 102 without a passerbybeing able to interpret the audio commands. Further, such I/O devices108 may include a microphone configured to capture the user'ssounds/words. In some embodiments, ATM 102 may include a microphoneinstalled therein. In some embodiments, a user's headphones may includea microphone and speakers to interact with ATM 102.

Voice-to-text agent 109 may be configured to decipher one or more audiomessages transmitted from user to ATM 102. For example, Voice-to-textagent 109 may translate the received audio messages to text forsubsequent processing by controller 103. For example, NLP 111 may beconfigured to process the translated text-based message. Upon processingthe text-based message, audio command agent 110 may be furtherconfigured to generate audio commands to be transmitted to user.

Audio command agent 110 may be enabled responsive to audio command agent110 detecting a connection with audio I/O device 108. In someembodiments, the connection between audio I/O device 108 and audiocommand agent 110 may be a wired connection (e.g., an auxiliaryconnection, a USB-C connection, lightening connection, etc.). In someembodiments, the connection between audio I/O device 108 and audiocommand agent 110 may be wireless (e.g., Bluetooth, ZigBee, NFC, etc.).

In operation, ATM 102 may receive a request from user for audio commandsupon audio command agent 110 detecting an audio connection with an I/Odevice 108 of the user. Upon detecting this connection, audio commandagent 110 may provide user with one or more command options. Such audiocommands may be beneficial, for example, in situations when avision-impaired user is attempting to operate ATM 102. However, ratherthan audibly broadcast audio commands to the user such that passerbyscan hear and decipher the communication between the user and ATM 102,the present system utilizes audio I/O device 108 to discretelycommunicate options to the user such that passerbys would not be able tohear the options communicated to the user. Further, through audio I/Odevice 108, ATM 102 may also discretely receive confirmations orrequests from the user in the form of simple responses that may bemapped to options communicated to the user. For example, rather than theuser audibly reciting his or her PIN, audio command agent 110 may recitea list of candidate numbers to the user for a first digit of a user'sPIN and wait for the user to select an option from the list.

Upon receipt of a request from ATM 102, ATM 102 may communicate therequest to financial services server 104. The request may include, at aminimum, the request from the user as well as a user identifier. Theuser identifier may include the user's account number, social securitynumber, debit card number, credit card number, address, and the like.Financial services server 104 may include an account handler 114.Account handler 114 may be configured to receive the request from ATM102 and manage an account associated with the user to carry out therequest. Account handler 114 may communicate with database 106. Database106 may include one or more user profiles 116. Each user profile 116 mayinclude one or more accounts 118. Such accounts may include, forexample, credit card accounts, savings accounts, checking accounts, andthe like.

In operation, for example, upon receiving a user identifier (e.g.,account number) and PIN from ATM 102, account handler 114 may carry outan authentication request by querying the database 106 with the user'saccount number to determine whether the received PIN matches the PIN inuser profile 116. Upon authenticating the user, financial servicesserver 104 may transmit a confirmation back to ATM 102, such that ATM102 can provide the user with further prompts for further requests. Suchauthentication of the user may establish a “session” between the userand the financial services server 104.

In another operation, for example, upon a session being establishedbetween the user and financial services server 104, account handler 114may carry out a withdrawal request from the user by querying database106 to determine whether there are sufficient funds in user's account118 to satisfy the request. Upon determining there are sufficient funds,financial services server 104 may transmit a confirmation message to ATM102. In some embodiments, along with the confirmation message, financialservices server 104 may further transmit a message containing an updatedbalance of the user's account 118.

FIG. 2 is a flow diagram illustrating an exemplary method 200 ofreceiving voice input at an ATM (e.g., ATM 102), according to oneembodiment. Method 200 begins at step 202. At step 202, ATM 102 mayreceive a signal from a trigger. In some embodiments, the signal, forexample, may be a request to access an account associated with the user.In some embodiments, the trigger may be a manual input on a screen ofATM 102. In some embodiments, the trigger may be an audio connectionestablished between ATM 102 and audio I/O device 108. For example, theaudio connection may be a wired connection between ATM 102 and audio I/Odevice 108 or a wireless connection between ATM 102 and audio I/O device108.

At step 204, ATM 102 may determine whether an audio connection isestablished between ATM 102 and user. For example, ATM 102 may determinewhether an audio I/O device 108 has been connected to audio commandagent 110 via a wired or wireless connection. If, at step 204, ATM 102determines that there is not an audio connection established between ATM102 and the user, method 200 proceeds to step 206. At step 206, ATM 102may prompt the user to establish the audio connection to proceed.

If, however, at step 204, ATM 102 determines that there is an audioconnection established between ATM 102, then method 200 proceeds to step208. At step 208, ATM 102 may generate an audio message to betransmitted to user. ATM 102 may generate an audio message for a firstcomponent of user's PIN. The audio message may include for example, astring of numbers to be recited to the user. The string of numbers mayinclude single digit numbers that are possible candidates for a firstcomponent of user's PIN. In some embodiments, the string of numbers arein numerical order (e.g., 1, 2, 3, . . . 9). In some embodiments thestring of numbers are recited in a pre-defined order (e.g., 2, 4, 6, 8,1, 3, 5, 7, 9). In some embodiments, the string of numbers is recited ina random (or pseudorandom) order.

At step 210, ATM 102 may transmit the audio message to the user via theaudio connection. In some embodiments, the message may be transmitted tothe user, such that the user can adjust the speed at which eachcandidate (i.e., number) is recited to the user. In some embodiments,the message may be transmitted to the user at a pre-defined speed. Insome embodiments, the speed between candidates may vary. For example,the gap between a first candidate and a second candidate may be aboutone second, while the gap between the second and a third candidate maybe about a half second.

At step 212, ATM 102 may receive, from the user, an indication to selectan entry in the set of candidates. For example, audio command agent 110may receive the indication from the user via a microphone (either theATM's or the audio I/O device's). Such indication, for example, mayinclude a simple “yes” following a recitation of a candidate in themessage. For example, following candidate “2” in the message, the usermay respond with “yes.”

At step 214, ATM 102, upon receiving the indication, may map theindication to a candidate in the set of candidates in the message. Forexample, audio command agent 110 may receive and parse the indicationfrom the user. Upon parsing the indication, audio command agent 110 mayidentify to which candidate in the set of candidates the indicationcorresponds. In some embodiments, audio command agent may identify towhich candidate in the set of candidates the indication corresponds bypairing the time in which ATM 102 receives the response from the userwith the duration of time since the audio message started to play.

At step 216, ATM 102 may generate a confirmation message to betransmitted to the user. For example, audio command agent 110 maygenerate a confirmation message such that the user can audibly confirmthat audio command agent 110 correctly mapped the indication to aparticular candidate. Continuing with the above example, upon mappingthe user's indication to “2,” audio command agent 110 may generate amessage that recites “You entered 2. Is this correct?”

At step 218, ATM 102 may transmit the confirmation message to the uservia the audio connection. At step 220, ATM 102 may receive aconfirmation from the user. For example, user may have listened to theconfirmation message via audio I/O device 108, and, upon reviewing theconfirmation message, affirmed that the mapping between the user'sindication and the candidate is correct. In some embodiments, ATM 102may receive a corrective response from the user. For example, uponreceiving a confirmation message from ATM 102 seeking confirmation onthe selection of “1” in the string “2 . . . 5 . . . 7 . . . 1,” the usermay respond with “back three” thus notifying ATM 102 that the mappingwas incorrect, and what the correct mapping should be.

At step 222, ATM 102 may determine whether there are remainingcomponents in user's PIN. For example, audio command agent 110 maygenerate a message asking the user whether there are additionalcomponents to user's PIN. Such message may include, for example, “Do youwish to submit an additional component?” Upon receiving an indicationthat there are remaining components to user's PIN (e.g., a “Yes” to theabove question), method 200 reverts to step 208, and audio command agent110 may generate a subsequent audio message to be transmitted to user.For example, audio command agent 110 may generate an audio message for afurther component of user's PIN. If, however, at step 222, ATMdetermines that there are no remaining components to user's PIN, thenmethod 200 may end.

FIG. 3 is a flow diagram an exemplary method 300 of transacting at anATM (e.g., ATM 102) using audio commands, according to one embodiment.For example, method 300 may be performed subsequent to ATM 102 receivinga PIN for the user, such as described above in conjunction with FIG. 2.

Method 300 begins at step 302. At step 302, ATM 102 may transmit the PINto financial services server. For example, after receiving audibleindications from user to input a PIN, ATM 102 may transmit a messagecontaining the user's PIN to financial services server 104. Such messagemay include the user's PIN as well as a user identifier. The useridentifier may include a user's name, account number, debit card number,credit card number, social security number, phone number, and the like.

At step 304, financial services server 104 may receive the PIN from ATM102. At step 306, financial services server 104 may determine whetherthe PIN received from ATM 102 is a valid PIN. In some embodiments,account handler 114 may query database 106 with the received useridentifier to identify a user profile 116 associated therewith. Based onthe identified user profile 116, account handler 114 may verify the PINby determining whether the PIN matches a PIN associated with an account118 in user's profile 116.

At step 308, financial services server 104 may transmit a confirmationmessage to ATM 102 that confirms the received PIN. At step 310, ATM 102may receive the confirmation from financial services server 104. Uponreceiving the confirmation from financial services server 104, at step312, ATM 102 may establish a session between the user and financialservices server 104. Such session may allow the user to accessfunctionality of financial services server 104 via ATM 102. For example,the user may be able to withdraw funds, deposit funds, transfer funds,check account information, and the like, during the session.

At step 314, ATM 102 may generate an audio message presenting the userwith one or more action items. For example, audio command agent 110 maygenerate an audio message that provides the user with one or moreoptions. Such options may include, for example, “check balance,”“deposit check,” “deposit cash,” “withdrawal,” “transfer funds,”“purchase stamps,” and the like. In some embodiments, the audio messagegenerated by audio command agent 110 may present the options to the userin a pre-defined order, such as that provided above. In someembodiments, the audio message generated by audio command agent 110 maypresent the options to the user in a random (or pseudorandom) order. Inthis manner, passerbys will not be able to predict the option selectedby the user. In some embodiments, ATM 102 may allow the user to specifythe preference in which the user is presented with candidates. Forexample, the user may store in his or her account, one or morepreference related to audio commands from ATM 102.

At step 316, ATM 102 may transmit the audio message to the user. Forexample, audio command agent 110 may transmit the audio message with theone or more options to the user via the audio connection. At step 318,ATM 102 may receive an indication from the user directed to a selectionof an option in the audio message. For example, audio command agent 110may receive the indication from the user via a microphone (either theATM's or the audio I/O device's). Such indication, for example, mayinclude a simple “yes” following a recitation of an option in themessage. For example, following the option “Deposit Funds” in themessage, the user may respond with “yes.” In another example, followingcandidates “Deposit Funds . . . Withdraw Funds,” the user may respondwith “go back one.”

At step 320, ATM 102, upon receiving the indication, may map theindication to a candidate in the set of candidates in the message. Forexample, audio command agent 110 may receive and parse the indicationfrom the user. Upon parsing the indication, audio command agent 110 mayidentify to which candidate in the set of candidates the indicationcorresponds. Continuing with the above example, upon receiving anindication of the form “go back one,” audio command agent 110 may mapthe indication to “Deposit Funds” in the message “Deposit Funds . . .Withdraw Funds.”

At step 322, ATM 102 may initiate the action corresponding to the mappedoption. For example, ATM 102 may work in conjunction with financialservices server 104 to access user's account, such that user may, forexample, deposit funds to his or her account.

FIG. 4 is a flow diagram illustrating a method 400 of transacting at anATM (e.g., ATM 102) using audio commands, according to one embodiment.For example, method 400 may be performed in conjunction with the method200, 300 discussed above.

Method 400 begins at step 402. At step 402, ATM 102 may receive arequest for audio commands. In some embodiments, the request for audiocommands may take the form of an audio connection established betweenthe ATM 102 and the user via audio I/O device 108. In some embodiments,the request for audio commands may take the form of a manual input via agraphical user interface displayed to the user.

At step 404, ATM 102 may identify that an audio connection isestablished with the user. In some embodiments, the audio connection maybe a wired connection between the user and ATM 102. In some embodiments,the audio connection may be a wireless connection between the user andATM 102.

At step 406, ATM 102 may transmit an audio message to the user via theaudio connection. In some embodiments, the audio connection may includeone or more candidates for entry of a PIN by the user. In someembodiments, the audio connection may include one or more options that auser my select.

At step 408, ATM 102 may determine that the audio connection with theuser is broken. In some embodiments, ATM 102 may determine that thewired connection with the user is broken (e.g., the auxiliary cord wasremoved from an auxiliary port of ATM 102). In some embodiments, ATM maydetermine that the wireless connection with the user is broken (e.g.,the Bluetooth connection with the user is interrupted).

At step 410, ATM 102 may cease audio commands upon determining that theaudio connection with the user is broken. In some embodiments, ATM 102may immediately terminate a session established with the user upondetermining that the audio connection with the user is broken. In someembodiments, ATM 102 may terminate the session after a predeterminedperiod has elapsed. In some embodiments, method 400 may include step412. At step 412, upon determining that the audio connection with theuser is broken, ATM 102 may prompt user to reestablish the audioconnection. In some embodiments, ATM 102 may prompt user by displaying amessage via a graphical user interface. In some embodiments, ATM 102 mayprompt the user audibly, by transmitting a message to the user viaspeakers in ATM 102.

FIG. 5 is a block diagram illustrating an exemplary computingenvironment 500, according to one embodiment. Computing environment 500includes computing system 502 and computing system 552. Computing system502 may be representative of controller 103 of ATM 102. Computing system552 may be representative of financial services server 104.

Computing system 502 may include a processor 504, a memory 506, astorage 508, and a network interface 510. In some embodiments, computingsystem 502 may be coupled to one or more I/O device(s) 522. One or moreI/O devices 522 may include at least a speaker 524 and a microphone 526.In some embodiments, speaker 524 and microphone 526 may be packaged in asingle unit. In some embodiments, speaker 524 and microphone 526 may bepackaged separately.

Processor 504 may retrieve and execute program code 516 (i.e.,programming instructions) stored in memory 506, as well as stores andretrieves application data. Processor 504 may be included to berepresentative of a single processor, multiple processors, a singleprocessor having multiple processing cores, and the like. Networkinterface 510 may be any type of network communications enablingcomputing system 502 to communicate externally via computing network505. For example, network interface 510 allows computing system 502 tocommunicate with computer system 552.

Storage 508 may be, for example, a disk storage device. Although shownas a single unit, storage 508 may be a combination of fixed and/orremovable storage devices, such as fixed disk drives, removable memorycards, optical storage, network attached storage (NAS), storage areanetwork (SAN), and the like.

Memory 506 may include a voice-to-text agent 509, natural languageprocessor (NLP) 511, audio command agent 512, operating system 514,program code 516, and wireless connectivity module 518. Program code 516may be accessed by processor 504 for processing (i.e., executing programinstructions). Program code 516 may include, for example, executableinstructions configured to perform steps discussed above in conjunctionwith FIGS. 2-4. As an example, processor 504 may access program code 516to perform operations for transacting at an ATM (e.g., ATM 102) usingaudio commands. Audio command agent 512 may be configured facilitatetransacting at ATM 102 using audio commands. For example, audio commandagent 512 may be configured to transmit audio messages to a user of ATM102 as well as receive audio commands from user. Audio command agent 110may interface with an audio input/out device (e.g., speaker 524,microphone 526, audio I/O device 108). Voice-to-text agent 509 and NLP511 may work in conjunction with audio command agent 512. Voice-to-textAgent 509 may be configured to decipher one or more audio messagestransmitted from user to computing system 502. For example,voice-to-text agent 509 may translate the received audio messages totext for subsequent processing by processor 504. For example, NLP 511may be configured to process the translated text-based message. Uponprocessing the text-based message, audio command agent 512 may befurther configured to generate audio commands to be transmitted to user.Wireless connectivity module 518 may be configured to allow computingsystem 502 to wirelessly connect with one or more I/O devices 522. Insome embodiments, wireless connectivity module 518 may be one of aBluetooth module, NFC module, ZigBee module, and the like.

Computing system 552 may include a processor 554, a memory 556, astorage 558, and a network interface 560. In some embodiments, computingsystem 552 may be coupled to one or more I/O device(s) 572 (e.g.,keyboard, mouse, etc.). In some embodiments, computing system 552 may bein communication with database 106.

Processor 554 retrieves and executes program code 566 (i.e., programminginstructions) stored in memory 556, as well as stores and retrievesapplication data. Processor 554 is included to be representative of asingle processor, multiple processors, a single processor havingmultiple processing cores, and the like. Network interface 560 may beany type of network communications allowing computing system 552 tocommunicate externally via computing network 505. For example, networkinterface 560 is configured to enable external communication withcomputing system 502.

Storage 558 may be, for example, a disk storage device. Although shownas a single unit, storage 558 may be a combination of fixed and/orremovable storage devices, such as fixed disk drives, removable memorycards, optical storage, network attached storage (NAS), storage areanetwork (SAN), and the like.

Memory 556 may include account handler 562, operating system 564, andprogram code 566. Account handler 562 may be configured to receive oneor more requests from computing system 502 and manage an accountassociated with the user to carry out the request. Account handler 562may communicate with database 106. Database 106 may include one or moreuser profiles. Each user profile may include one or more accounts. Suchaccounts may include, for example, credit card accounts, savingsaccounts, checking accounts, and the like. Program code 566 may beaccessed by processor 554 for processing (i.e., executing programinstructions). Program code 566 may include, for example, executableinstructions for communicating with computing system 502 to allow a userto transact at an ATM using audio commands.

While the foregoing is directed to embodiments described herein, otherand further embodiments may be devised without departing from the basicscope thereof. For example, aspects of the present disclosure may beimplemented in hardware or software or a combination of hardware andsoftware. One embodiment described herein may be implemented as aprogram product for use with a computer system. The program(s) of theprogram product define functions of the embodiments (including themethods described herein) and can be contained on a variety ofcomputer-readable storage media. Illustrative computer-readable storagemedia include, but are not limited to: (i) non-writable storage media(e.g., read-only memory (ROM) devices within a computer, such as CD-ROMdisks readably by a CD-ROM drive, flash memory, ROM chips, or any typeof solid-state non-volatile memory) on which information is permanentlystored; and (ii) writable storage media (e.g., floppy disks within adiskette drive or hard-disk drive or any type of solid staterandom-access memory) on which alterable information is stored. Suchcomputer-readable storage media, when carrying computer-readableinstructions that direct the functions of the disclosed embodiments, areembodiments of the present disclosure.

It will be appreciated to those skilled in the art that the precedingexamples are exemplary and not limiting. It is intended that allpermutations, enhancements, equivalents, and improvements thereto areapparent to those skilled in the art upon a reading of the specificationand a study of the drawings are included within the true spirit andscope of the present disclosure. It is therefore intended that thefollowing appended claims include all such modifications, permutations,and equivalents as fall within the true spirit and scope of theseteachings.

What is claimed:
 1. A method of transacting at an automated tellermachine (ATM), comprising: performing, by the ATM, an authenticationoperation by: receiving a plurality of audible indications from theuser, the plurality of audible indications corresponding to anauthentication input associated with the user, without audibly revealingany component of the authentication input; comparing the plurality ofaudible indications received from the user to the authentication inputassociated with the user; and upon determining that the plurality ofaudible indications correspond to the authentication input associatedwith the user, authenticating the user at the ATM; and performing, bythe ATM, a transaction operation by: generating an audio messagecomprising one or more transaction operations available at the ATM;receiving a second audible indication from the user to select at leastone transaction operation of the one or more transaction operationsavailable at the ATM, without audibly revealing the selected transactionoperation; mapping the second audible indication to a respectivetransaction operation in the set of transaction operations; andprocessing the selected transaction operation.
 2. The method of claim 1,wherein the authentication operation is triggered by: receiving, by theATM, a request to provide audio commands for input associated withauthenticating the user at the ATM.
 3. The method of claim 2, whereinreceiving, by the ATM, the request to provide the audio commands for theinput associated with authenticating the user at the ATM, comprises:receiving a request to enter a personal identification number (PIN) thatcomprises one or more digits, each digit corresponding a respectivecomponent of the one or more components.
 4. The method of claim 2,wherein receiving, by the ATM, the request to provide the audio commandsfor the input associated with authenticating the user at the ATM,comprises: receiving a request to enter a withdrawal amount.
 5. Themethod of claim 1, wherein the authentication operation furthercomprises: generating an audio message for each component of theauthentication input, wherein the audio message comprises a set ofcandidate entries for each component.
 6. The method of claim 5, whereingenerating the audio message for each component of the authenticationinput is triggered by identifying an auxiliary connection at the ATM. 7.The method of claim 5, wherein generating the audio message for eachcomponent of the authentication input is triggered by identifying aclient device of the user via wireless connection.
 8. A system,comprising: a processor in selective communication with a microphone anda speaker; and a memory having programming instructions stored thereon,which, when executed by the processor, performs one or more operationscomprising: performing, by the ATM, an authentication operation by:receiving a plurality of audible indications from the user, via themicrophone, the plurality of audible indications corresponding to anauthentication input associated with the user, without audibly revealingany component of the authentication input; comparing the plurality ofaudible indications received from the user to the authentication inputassociated with the user; and upon determining that the plurality ofaudible indications correspond to the authentication input associatedwith the user, authenticating the user at the ATM; and performing, bythe ATM, a transaction operation by: generating an audio messagecomprising one or more transaction operations available at the ATM;receiving a second audible indication from the user, via the microphone,to select at least one transaction operation of the one or moretransaction operations available at the ATM, without audibly revealingthe selected transaction operation; mapping the second audibleindication to a respective transaction operation in the set oftransaction operations; and processing the selected transactionoperation.
 9. The system of claim 8, wherein the authenticationoperation is triggered by: receiving a request to provide audio commandsfor input associated with authenticating the user at the ATM.
 10. Thesystem of claim 9, wherein receiving the request to provide the audiocommands for the input associated with authenticating the user at theATM, comprises: receiving a request to enter a personal identificationnumber (PIN) that comprises one or more digits, each digit correspondinga respective component of the one or more components.
 11. The system ofclaim 9, wherein receiving the request to provide the audio commands forthe input associated with authenticating the user at the ATM, comprises:receiving a request to enter a withdrawal amount.
 12. The system ofclaim 8, wherein the authentication operation further comprises:generating an audio message for each component of the authenticationinput, wherein the audio message comprises a set of candidate entriesfor each component.
 13. The system of claim 12, wherein generating theaudio message for each component of the authentication input istriggered by identifying an auxiliary connection at the ATM.
 14. Thesystem of claim 12, wherein generating the audio message for eachcomponent of the authentication input is triggered by identifying aclient device of the user via wireless connection.
 15. A non-transitorycomputer readable medium including one or more sequences of instructionsthat, when executed by one or more processors, cause the processors toperform operations comprising: performing, by the ATM, an authenticationoperation by: receiving a plurality of audible indications from theuser, the plurality of audible indications corresponding to anauthentication input associated with the user, without audibly revealingany component of the authentication input; comparing the plurality ofaudible indications received from the user to the authentication inputassociated with the user; and upon determining that the plurality ofaudible indications correspond to the authentication input associatedwith the user, authenticating the user at the ATM; and performing, bythe ATM, a transaction operation by: generating an audio messagecomprising one or more transaction operations available at the ATM;receiving a second audible indication from the user to select at leastone transaction operation of the one or more transaction operationsavailable at the ATM, without audibly revealing the selected transactionoperation; mapping the second audible indication to a respectivetransaction operation in the set of transaction operations; andprocessing the selected transaction operation.
 16. The non-transitorycomputer readable medium of claim 15, wherein the authenticationoperation is triggered by: receiving, by the ATM, a request to provideaudio commands for input associated with authenticating the user at theATM.
 17. The non-transitory computer readable medium of claim 16,wherein receiving, by the ATM, the request to provide the audio commandsfor the input associated with authenticating the user at the ATM,comprises: receiving a request to enter a personal identification number(PIN) that comprises one or more digits, each digit corresponding arespective component of the one or more components.
 18. Thenon-transitory computer readable medium of claim 16, wherein receiving,by the ATM, the request to provide the audio commands for the inputassociated with authenticating the user at the ATM, comprises: receivinga request to enter a withdrawal amount.
 19. The non-transitory computerreadable medium of claim 15, wherein the authentication operationfurther comprises: generating an audio message for each component of theauthentication input, wherein the audio message comprises a set ofcandidate entries for each component.
 20. The non-transitory computerreadable medium of claim 19, wherein generating the audio message foreach component of the authentication input is triggered by identifyingan auxiliary connection at the ATM or identifying a client device of theuser via wireless connection.